WOW: the Hungarian Deep Web Searcher
نویسندگان
چکیده
This paper summarizes the goals and presents the results of our ongoing research and development project, called “In the Web of Words” (WOW), funded by the National R+D Program in Hungary. The project aims at creating a complex search interface that incorporates — beside the usual keyword-based search functionality—deep web search, Hungarian natural language (NL) question processing, image search support by visual thesaurus. In this paper we focus on system architecture and NL processing. One of the most crucial part of the system is the transformation of NL questions to adequate SQL queries that is in accordance with schema and attribute convention of contracted partner databases. This transformation is performed in three steps: NL question processing, context recognition, and SQL transformation.
منابع مشابه
Entity Recognizer in Hungarian Question Processing
In our ongoing research and development project, called “In the Web of Words” (WoW), funded by the National R+D Program in Hungary, we aim to create a complex search interface that incorporates— beside the usual keyword-based search functionality—(1) deep web search, (2) Hungarian natural language question processing, (3) image search support by visual thesaurus. This paper focuses on a particu...
متن کاملOn the Transformation of Sentences with Genitive Relations to SQL Queries
In our ongoing project called “In the Web of Words” (WoW) we aimed to create a complex search interface that incorporates a deep web search engine module based on a Hungarian question processor. One of the most crucial part of the system was the transformation of genitive relations to adequate SQL queries, since e.g. questions begin with “Who” and “What” mostly contain such a relation. The geni...
متن کاملCross-Lingual Image Search on the Web
Most people locate images on the Web by querying image search engines such as Google’s. The images are tagged by the words in their “vicinity”, which limits the ability of a searcher to retrieve them. Although images are universal, an English searcher will fail to find images tagged in Chinese, and a Spanish searcher will fail to find images tagged in English. Cross-lingual homonyms cause probl...
متن کاملDetermining Relevant Deep Web Sites by Query Context Identification
Deep web search requires a transformation between search keywords and semantically described and well-formed data structures. We approached this problem in our “In the Web of Words” (WoW) project by allowing natural language sentence queries and by a context identification method that connects the queries and deep web sites via database information. In this paper we propose a novel SQL based ap...
متن کاملAnomaly-based Web Attack Detection: The Application of Deep Neural Network Seq2Seq With Attention Mechanism
Today, the use of the Internet and Internet sites has been an integrated part of the people’s lives, and most activities and important data are in the Internet websites. Thus, attempts to intrude into these websites have grown exponentially. Intrusion detection systems (IDS) of web attacks are an approach to protect users. But, these systems are suffering from such drawbacks as low accuracy in ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004